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ABSTRACT 

With a high spatial resolution imaging instrument such as the Chandra/ ACIS, one 
can confidently identify an X-ray source with only a few detected counts. The detec- 
tion threshold of such sources, however, varies strongly across the field-of-view of the 
instrument. Furthermore, the low detection counting statistics, together with a typical 
steep source number-flux relation, causes more intrinsically faint sources to be detected 
at apparently higher fluxes than the other way around. We quantify this "X-ray Ed- 
dington bias" as well as the detection threshold variation and devise simple procedures 
for their corrections. To illustrate our technique, we present results from our analysis 
of X-ray sources detected in the fields of the large-scale hierarchical complex Abell 2125 
at z = 0.247 and the nearby galaxy NGC 4594 (Sombrero). We show that the sources 
detected in the Abell 2125 field, excluding 10 known complex members, have a number- 
flux relation consistent with the expected from foreground or background objects. In 
contrast, the number-flux relation of the NGC 4594 field is dominated by X-ray sources 
associated with the galaxy. This galactic component of the relation is well characterized 
by a broken power law. 

Subject headings: methods: data analysis — methods: statistical — galaxies: clusters: 
individual (Abell 2125) — galaxies: individual (NGC 4594) — X-rays: general — X- 
rays: galaxies 



1. Introduction 

A classic problem in studying faint sources is the determination of both the threshold of their 
detection and the bias in their flux measurement. This problem is particularly acute in the study 
of discrete X-ray sources, most of which are detected with very limited counting statistics in a 
typical observation. Furthermore, the threshold may vary strongly across the fleld of view (FoV) of 
the photon-detecting instrument, depending on the point spread function (PSF; e.g., Jerius et al. 
2000) as well as the local background and effective exposure. The threshold variation also differs 
from one observation to another but is often overlooked or oversimplifled. 
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Closely related to the detection threshold is the bias in the measurement of source fluxes. 

Because of the statistical uncertainties in the photon counting, the fluxes are statistically over- 
estimated because there are typically far more truly faint sources than bright ones. So more sources 
are "up-scattered" to a given flux measurement than those that are "down-scattered", which is 
similar to the so-called Eddington bias in the optical photometry of faint objects (Eddington 1940; 
Murdoch, Crawford, k, Jauncey 1973; Hogg k, Turner 1998; Kenter &; Murray 2003 and references 
therein). Hogg & Turner (1998) prescribed a maximum likelihood (ML) correction for the bias as a 
function of the detection signal-to-noisc ratio (S/N), assuming a Gaussian-distributed error in the 
photometry. They further concluded that a source identified with S/N 4 or less is practically 
useless, because the bias would be too severe to allow for any reasonable estimation of the true 
flux. In X-ray astronomy, source detection thresholds corresponding to S/N ~ 3 or less are often 
used (e.g., Di Stefano et al. 2003). But because the counting error distribution here is typically 
Poissonian, rather than Gaussian, a S/N ratio alone does not directly translate to a false detection 
probability, which also depends on the local background contribution (Schmitt &: Maccacaro 1986). 
Therefore, it is important to check how this difference in the error distributions affects the Eddington 
bias. 

One can effectively treat both the threshold variation and the Eddington bias by calculating 
a redistribution matrix of the source flux measurement (Kenter &; Murray 2003). The analysis of 
the source number-flux relation, for example, is mathematically analogous to spectral fitting, for 
which sophisticated software packages such as XSPEC (Arnaud 1996) are publicly available and are 
widely used. With such a software package, one can effectively analyze the number-flux relation of 
X-ray sources, including the effects of both the threshold variation and the Eddington bias. Indeed, 
Kenter & Murray (2003) have recently illustrated this technique by using extensive ray-tracing and 
Monte Carlo simulations and by applying it to the wavelet source-detection analysis of a Chandra 
ACIS image. Here we describe a simple and yet general procedure that allows for the calculation 
of the redistribution matrix without resorting to the ray-tracing. 

To illustrate this procedure, we present the analysis of X-ray sources detected in two Chandra 
observations. The first is an observation of Abell 2125 (Fig. 1), a complex of galaxies and diffuse 
hot gas at z = 0.247. This 82 ksec observation has the FoV of 17' x 17' (including only the 2x2 
ACIS-I CCD array). Wang, Owen, & Ledlow (2004) have presented the main results from the 
observation, which are based partly on the analysis discussed here. The second observation is an 
18.5 ksec ACIS-S exposure of NGC 4594 (Sombrero), a nearly edge-on Sa galaxy at a distance of 8.9 
Mpc (Fig. 2). Only the data from the on-axis (# 7) back-illuminated chip with a FoV of 8^4x8^4 
are included. A study of the discrete X-ray sources detected with the same data has been reported 
by Di Stefano et al. (2003) and is compared in the present work. 

Our study does not account for all potential instrumental effects. In particular, multiple sources 
of small angular separations may produce a single detection, affecting not only the source number 
counting but also the shape of the number- flux relation (Hasinger et al. 1993). Fortunately, with 
the superb spatial resolution of an imaging instrument such as Chandra, the effect is typically not 
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important (< a few percent). For the observations analyzed here, we use the position-dependent 

90% energy-encircled radius as the detection aperture (EER; Jerius et al. 2000), which is a factor 
of 2 smaller than the source removal radius shown in Figs. 1 and 2b. Clearly, few sources are 
affected by overlapping detection apertures. 



2. Brief Discussion of X-ray Source Detection and Measurement 

Here we consider the detection of point-like X-ray sources based on an event count image (e.g.. 
Figs. 1 and 2b). Counts in such an image can be divided approximately into two components: one 
from individual X-ray sources and the other from a smoothly-distributed background, consisting of 
X-ray photons and charged particle-induced events. This backgroTind varies across the image, but 
typically on scales substantially greater than the size of the PSF (Jerius et al. 2000). 

Various source detection and analysis schemes (wavelet, sliding-box, and maximum likelihood 
centroid fitting) have been used in previous studies (e.g.. Freeman ct al. 2002; Harnden et al. 
1984; Cruddace et al. 1988). Applications of these schemes to Chandra imaging data have been 
described in Wang et al. (2003; 2004). To detect a source is to find a significant count number 
deviation from the expected statistical background fluctuation: the total count number ric within 
a certain detection aperture is compared with the expected contribution from the background rih 
(e.g., Fig. 3a), accounting for the statistical fluctuation, is typically estimated from the diffuse 
X-ray intensity in the region surrounding the detection aperture. Specifically, we first remove 
source candidates identified in the wavelet detection with a reduced threshold {S/N ~ 2) and 
then smooth the remaining image with a Gaussian kernel adjusted adaptively to achieve a spatially 
uniform count-to-noise ratio of ~ 10. The exposure correction for the intensity calculation accounts 
for such effects as the effective area variation, bad pixel/column removal, and CCD gaps. The 
resultant background intensity image is not sensitive to the exact source-removal threshold and 
smoothing method. The image, multiplied by the exposure map, gives the background count 
image (e.g.. Fig. 3a), from which nt, can be obtained within the corresponding detection aperture. 
The selected intensity-to-noise ratio in the smoothing is large enough so that the uncertainty in 
the background intensity estimate is small and may thus be neglected. The application of such 
a background map in a source search is sometimes called "the map detect algorithm", which is 
well-defined for a statistical analysis of the source detection. 

The significance of a count deviation above the background contribution rib can be characterized 
by the false detection probability P defined as 

Tic 1 fi 

P = l-y !^e-"\ (1) 

n=0 

If P < Pth, a preset threshold, one may declare a positive source detection. This single threshold is 
normally sufficient, except for the situation in which rib is extremely small (e.g., in regions partially 
covered by CCD gaps due to the telescope dithering; Fig. 1). Then an additional condition needs to 
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Fig. 1. — ACIS-I count image of Abell 2125 in the 0.5-8 keV band. Detected X-ray sources are marked with circles 
of twice the ~ 90% energy-encircled radii of the PSF (Jerius et al. 2000), estimated in the same band for a power 
law spectrum with photon index equal to 1.7 and Nh ~ 3 x 10^° cm~^. 

be imposed for a meaningful detection (sec further discussion in §3). The probability P, however, is 
for a single comparison only. In a typical blind search for sources, an entire X-ray image is scanned, 
invoking many statistically independent comparisons. Therefore, the false detection probability in 
such a search is much greater than P and depends on both the detection aperture and the size 
of the image as well as the choice of the threshold Pth- For example, searching in count images 
simulated by adding Poisson noises in the background image shown in Fig. 3a yields an average of 
- 0.05 and 0.6 fake detections for Pth = 10"^ and 10"^ 



-5- 




Fig. 2. — (a) 2MASS K-band image and (b) ACIS-S count image of NGC4594 in the 0.3-7 keV band. Detected 

X-ray sources are marked with circles of twice the 90% EER. Sources identified as globular clusters, stars, and central 
nucleus (all marked with x) are not included in the number-flux relation analysis (§4). 




Fig. 3. — (a) Smoothed background map of the count image as presented in Fig. 2b. The gray-scale is in the range 
from 0.01 (white) to 9.0 (black), (b) Source detection threshold map for Pth = 10~^, showing the minimum required 
number of net source counts (in the range of 4 - 40) within the position-dependent source-detection aperture and 
above the background as shown in (a). 
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Fig. 4. — Count rates of the sources detected in the 0.3-7 keV band versus their off-ajcis angles in the ACIS-S 
observation of NGC 4594. The curves illustrate the detection thresholds {Smin,k)- The upper curve is calculated via 
an azimuthal average, whereas the lower curve is obtained by choosing the lowest value in each concentric annulus 
around the aiming point of the observation. A similar plot for the Abell 2125 field is presented in Wang et al. (2004), 

For a detected source, one may estimate its count rate, S = Us/ (ryt), where ng = ric—nb, t, and 7] 
are respectively the net number of source counts, the effective exposure time, and the corresponding 
energy-encircled fraction of the detection aperture. The count rate can be easily converted into an 
energy flux if an X-ray spectrum is assumed. Thus the terms, count rate and energy flux, arc mostly 
interchangeable in the present work. The count rate threshold is S„iin.k = n's,min,k/i^'l'^k)i where 
ns^min is the minimum number of source counts required for a positive detection. Smin,k (oi' ns^min'i 
e.g.. Fig. 3b) generally decreases with increasing off-axis angle, chiefly because of increasing PSF 
size (e.g.. Figs. 1 and 2b; Jerius et al. 2000) and because of the decrease in the effective exposure 
(due to telescope vignetting). In the ACIS-S observation of NGC 4595 (Figs. 2 and 3), for example, 
the detection sensitivity is the highest at the aiming point (close to the axis of the telescope), which 
is about 2' southwest of the galaxy's center (Fig. 3b). The sensitivity is substantially lower in the 
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central region of the galaxy (Fig. 3b), because of the high diffuse X-ray background (Fig. 3a). The 
sensitivity is also low at large off-axis angles primarily due to relatively large PSF sizes, especially 
near the northeastern CCD boundaries (Fig. 3b). Fig. 4 presents the detected source count rates, 
compared with our calculated source detection thresholds. 

The count rate, or flux, of a detected source represents only a single realization of the intrin- 
sic source flux in the observation with limited counting statistics and is therefore subject to the 
Eddington bias. 

3. Maximum Likelihood Correction for the Eddington Bias 

Here we follow the approach described by Hogg k. Turner (1998) but assume the Poissonian 
counting statistics. The likelihood that a source with an intrinsic count rate S is observed to have 
a count rate So is (see also Schmitt & Maccacaro 1986) 

p(5^|5) = l!^i±^e-("^+"''). (2) 

Tic- 
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Fig. 5. — The dependence of the ML flux to the observed flux ratio on /3 = and 7 = The minimum Sml/So 
as a function of f3 is represented by the thick curve, whereas other curves from the left to the right are for 7 = 0.01, 
0.03, 0.1, 0.3, 1, 3, and 10. 

The likelihood p{S\So) for a true count rate S, given that this source is observed to have a 
count rate So, is related to p(>S'o|>S') by Hayes's theorem: 




p{S\So)^p{So\S)f{S), 



(3) 
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where f{S) is the underlying intrinsic number-flux relation in the count rate interval from S to 
S + dS. If the count rate measurement were unbiased, the peak in the likelihood function p{S\So) 
would be at S/Sq = 1. Because of the Eddington bias, however. So is statistically greater than S. 
Over a relatively small range of S, considered here for the bias correction, it is generally a good 
approximation to assume f{S) oc S~", where the exponent a is a constant and is typically around 
5/2 (the so-called Euclidean slope). We take the derivative of p{S\So) to find its ML peak position 
Sml, which leads to 

So 2^ ^)\'+[' 7(1-/3)2 J J' 

where P = ^ and 7 = Fig- 5 illustrates the dependence of the ratio on these two 
parameters; e.g., for a fixed 7 the ratio decreases with increasing (3. Evidently, the existence of the 
ML solution requires 

/3< 71/2/(2 + y/2) (5) 

or 

Us > a + {Aubay/^ . (6) 

If a = 5/2, 

n, >2.5 + (10n6)V2. (7) 

For example, Ug needs to be > 6, 4, or 3 for ^ 1, 0.1, or 0.01 (corresponding to log(P) = —3.2, 
—5.4, or —6.8; see Eq. 1). We force this condition to be satisfied for a positive source detection, 
in addition to P < Pj/j (§2). Otherwise, a detection with Ug = 1 might be claimed as a source if 
Hb is small enough (see Eq. 1), which could occur especially at edges of CCDs. The largest ML 
correction, or the smallest ratio is (see also Fig. 5) 

^ = ^(1-/3)- (8) 

If P (i.e., ns » a), the solution (4) then becomes 

This solution has the same appearance as the one derived by Hogg &: Turner (1998) based on an 

1/2 

assumed Gaussian error distribution. But Us/n^ here is the signal-to-background noise ratio, 
instead of the signal-to-noise ratio So/cr, where a is the Gaussian dispersion of So and is assumed 
to be a constant (Hogg &; Turner 1998). In X-ray astronomy, at least, the uncertainty in So 
depends on So itself, even when the Poissonian error distribution asymptotically approaches the 
Gaussian distribution for large Ug and n?,. Furthermore, the signal-to-background noise ratio is 
always substantially greater than the signal-to-noise ratio n^/ (ns-l-nft)^/^ for the same Ug- Therefore, 
adopting the Poissonian error distribution, which is appropriate for the X-ray counting statistics, 
we can apply the ML correction to individual sources detected with only a few counts, as long as 
Eq. 7 is satisfied. 
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4. Analysis of the Number-Flux Relation 

The goal of this analysis is to constrain the intrinsic (model) distribution f{S) over a broad 
range of S, based on an observed number-flux distribution. Integrating Eq. (3) with a proper 
normalization included gives the expected number of sources in the So to So + dSo interval 

POD 

N{So) = / RiSo,S)fiS)dS, (10) 
Jo 

where R{So, S) is the probability that a source with an intrinsic count rate S is expected to have a 
detected count rate in the interval. The above N{So) can be used directly in a ML analysis of the 
observed source number-flux distribution to constrain model parameters in f{S). 

However, it is often convenient to convert the above equation into a discrete expression with S 
and So divided into channels I and J of small count rate widths, A/ = Sf — S\ and Aj = Sj — Sj, 
where / and u denote the lower and upper count rate boundaries of the channels. We define the 
channels evenly on the logarithmic scale of source count rate. The overall spans of the channels 
should be large enough to cover the count rate range involved. For example, we choose the lowest 
/ channel to be a factor of 10 less than the lowest flux threshold of the source detection. By 
integrating over each channel J on both sides of Eq. 10, we get 



where 



and the redistribution matrix 



N{J) 



m 



RiJ,I) = 



^it;(j,/)/(/)A,, 

I 

/ f{S)dS/Ai 

j;i R{j,s)f{s)ds 

fsl f{S)dS 



(11) 



(12) 



(13) 



in which R(J, S) = / / R{So, S)dSo- In practice. A/ can be chosen to be small enough so that 



R{J,I) ^R{J,Si), 
where Si may be defined as Sj = (SiSu)^^'^- 



(14) 



As discussed in §3, the redistribution matrix is basically the Poisson probability distribution, 
although an average is required over the field from which the sources are detected. This step is 
similar to the construction of a weighted instrument redistribution function for the spectral analysis 
of a diffuse X-ray feature. In the present case, the average is needed because the source detection 
sensitivity varies across the field. The matrix defined at each pixel k can be expressed as 

if Sj < Smin,k 

RkiJ,Si) = l Qk{<S},Si)-Qk{<S'j,Si) iiS^j>Smin,k (15) 

Qfc(< Sj, Sj) — Qfc(< Sjnin^ki Si) if Sj < Smin^k < Sj 
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where Qk is the accumulated Poisson probability (see also Eq. 2). For example, 

f^max ( J. Q _|_ \Tl 

Qki< S:},Si) = £ i^^fc^^ + n;,,fc) ^-i,t,S,+n,,,)^ (Ig) 

71=0 ^' 

where Umax is the largest integer that is smaller than rji^Sj + Ub^k- When the expected value of 
rftkSj + n-b^k is large, Qk can be replaced by the standard error function of a Gaussian distribu- 
tion. The calculation does not need to be done for all J channels, as long as the range included 
covers essentially all the redistribution probability; a Aa range around each Sj is calculated in our 
applications. XSPEC, for example, allows for a redistribution matrix with a variable J channel 
dimension. The redistribution matrix averaged over all pixels is then 

Np 

R{J,I) = J2RkiJ,mk, (17) 
fc=i 

and 

Np 

J2^k = l, (18) 
k=l 

where is the intrinsic probability for a source to appear at the fcth pixel and may be inferred from 

their empirical spatial distribution, from a model, and/or from an image in another wavelength 
band (e.g., optical). For example, to estimate the total number of background sources in a field 
-^s,6) we can assume a uniform probability distribution (i.e., = 1/Np). On the other hand, we 
may assume that a source population of galaxy follows its near-infraxed light (e.g.. Fig. 2a) with 
an intensity Ik at pixel k. Under these assumptions, we have 

where A'^g is the total number of detected sources, background plus galactic. 

The above defined R{J, I) accounts for both the detection threshold variation and the flux 
redistribution of the sources. Fig. 6 shows an example of the redistribution matrix. As expected, 
the Eddington bias causes a systematic shift of the probability distribution peak (for a fixed S) to 
the right of So = S, apparent for S < 10~^ counts s~^. The dispersion in Sg also increases with 
decreasing S. Effects due to the discreteness of the Poissonian distribution are apparent at low 
So values. Fig. 7 illustrates these effects for three representative S values. Furthermore, the sum 
of R{J, I) over J channels gives the average detection probability as a function of intrinsic source 
count rate (e.g.. Fig. 8). 

R{J,I), together with the count rate boundaries of the / and J channels (all included in a 
standard fits file), can be imported into XSPEC. The only other file required is the histogram of 
the detected sources in the J channels. All other calculations, such as the binning of the model 
distribution, are performed automatically within XSPEC, which also allows for using or Cash's 
C-statistic in the model-fitting and for estimating model parameter uncertainties. 
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-3.5 -3.0 -2.5 -2.0 -1.5 
Log(So) 

Fig. 6. — Source count rate distribution matrix in the 0.3-7 keV band for the NGC 4594 observation (see also Figs. 
2-3; see also §5). The probability is scaled from 1 x 10~® (white) to 0.4 (black), logarithmically. Both S and So are 
in units of counts s~^. 

The model distribution f{S) can essentially be in any form, e.g., an additive and/or mul- 
tiplicative combination of various components. Different components may have different count 
rate-to-energy flux conversions, which depend on the line-of-sight absorption/scattering as well as 
the assumed intrinsic source spectra. Allowing for this flexibility is the primary reason for us to use 
count rates, instead of energy fluxes here. If a single conversion (i.e., a single incident spectrum) 
is assumed for all sources, expressing the procedure described above in terms of energy fluxes is 
trivial. 



5. Applications 

While the ML Eddington bias estimate for individual sources is straight forward, we con- 
centrate on the application of the source flux redistribution matrix. This procedure follows the 
standard data calibration, exposure map construction, source detection, and background image 
generation (e.g., Wang et al. 2003, 2004; see also Figs. 1-3). We adopt Pth = 10~^. From the 
ACIS-I observation of Abell 2125, we detect 81, 48, and 93 sources in the 0.5-2, 2-8, and 0.5-8 
keV bands, respectively. From the ACIS-S observation of NGC 4594, 80, 65, and 112 sources are 
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Fig. 7. — Illustration of the average source flux redistribution probability in the NGC 4594 observation. The vertical 
straight lines mark the selected intrinsic source fluxes {S), whereas the curves represent the probability distribution of 
the corresponding flux (So)- Notice that the solid line marks an S value that is below the lowest detection threshold 
of ~ 3 X 10"'* counts s~^. 



detected in the 0.3-1.5, 1.5-7, and 0.3-7 keV bands, respectively. After removing duplicates (spatial 
coincidences) in the detections, we find a total of 99 and 115 unique sources in the Abell 2125 and 
NGC 4594 fields (Figs. 1 and 2b). We then apply the procedure described in §4 to generate the 
required Atix redistribution matrix R{J,I) for each source detection band (e.g., Fig. 6), which can 
then be used to study the number-flux relations in the flelds. 



5.1. Contribution from Interlopers 

In a typical Chandra imaging observation, a considerable fraction, if not most, of the detected 
sources are interlopers. In a high Galactic latitude field, interlopers are typically background AGNs, 
plus a small number of foreground stars. The number-flux relation of such interlopers has been 
characterized in various X-ray surveys, including the Chandra ACIS-I deep surveys (Moretti et al. 
2003 and references therein). In almost all recent work, this characterization is done separately in 
the 0.5-2 kcV and 2-10 keV bands. The source populations detected in these two bands only partially 
overlap: Sources with steep X-ray spectra preferentially appear in the 0.5-2 keV band, whereas 
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Fig. 8. — Average detection probability of a source as a function of its intrinsic count rate in the NGC 4594 
observation. 



highly-absorbed ones appear in the 2-10 kcV band. In general, Chandra is much more sensitive 
to sources in the former band than in the latter band. Therefore, we focus on the accumulated 
number-flux relation in the 0.5-2 keV band, which can be modeled as: 



(2 X 10-^5)"! 



S) = No J, . , (20) 



where A^o = 6150 sources deg~^, ai = 1.82, a2 = 0.60, and = 1.48x10"^^ erg s~^ cm"^ (Moretti 
et al. 2003). We have implemented this model in XSPEC, which automatically computes /(/) = 
[A^(> S\) — N{> Sf)]A, where A in units of deg^ is the FoV of the observation in consideration. 
By convolving /(/) with R{J,I), we may predict the number-flux relation of the interlopers (Eq. 
11). 

However, caution must be exercised in using Eq. 20. The energy flux S specifically depends 
on the count rate-to-energy flux conversion that assumes a power law with a photon index of 1.4 
plus a foreground absorption of 1.6 x lO^'^ cm~^. Indeed, the power law is a good description 
of the overall spectral shape of the extragalactic X-ray background, which is now believed to be 
a composite of mostly AGNs. But, X-ray spectral properties do vary greatly from one source to 
another. Statistically, the average X-ray spectrum of AGNs tends to become flatter with decreasing 
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Fig. 9. — The on-axis count rate ratio in different bands versus X-ray-absorbing gas column density for a repre- 
sentative power-law models, (a) The ACIS-I ratios: 0.5-8 kcV to 0.5-2 kcV (thick curves) and 0.5-8 kcV to 2-8 kcV 
(thin curves), (b) The ratios: ACIS-S 0.3-7 keV to ACIS-I 0.5-2 keV (thick curves) and ACIS-S 0.3-7 keV to ACIS-I 
2-8 keV (thin curves). The three styles of the curves correspond to photon index choices: 1.4 (solid), 1.7 (dotted), 
and 2 (dashed). 

flux, apparently due to an increasing population of highly-absorbed AGNs. Unfortunately, such 
spectral variations are not yet well quantified and thus cannot be incorporated easily into the 
calculation of the count rate-to-energy flux conversion. Obviously, a deviation from the assumed 
source spectrum could cause errors. To minimize this effect, we convert the energy flux in the model 
back into the count rate in the 0.5-2 keV band of an ACIS-I observation, using the same assumed 
power law, but with the foreground absorption appropriate to a particular field. In the case of 
the Abell 2125 observation, for example, we use Nfj = 3 x lO^*' cm^^ to derive the conversion as 
4.5 X 10~^^ (erg cm~^ s~^)/(counts s~^), which leads to a predicted number-fiux relation of the 
interlopers, as is shown in Wang et al (2004). 

To construct the observed number-fiux relation, we exclude the 10 sources that have been found 
to be in positional coincidence with Abell 2125 complex member galaxies (Wang et al. 2004). These 
members are all detected in both 0.5-2 keV and 0.5-8 keV bands. We construct the observed relation 
in the 0.5-2 keV band by grouping the remaining 71 sources to have a minimum number of ~ 8 
sources per count rate bin. A direct comparison between the observed and predicted relations (not 
a fit) gives a C-statistic value of 6.9 for a total of 9 bins. The probability to have a C-statistic 
less than this value is 38%, according to Monte-Carlo simulations in XSPEC. Therefore, the two 
relations are statistically consistent with each other. 

A source number-fiux relation comparison may also be made for detections in other bands. This 
comparison is particularly desirable for the ACIS-I 0.5-8 keV band, in which the source detection 
is typically most sensitive. However, we are not aware of any number-flux relation constructed 
recently for interlopers in this band. Indirectly, one may still use Eq. 20 by converting the flux 
to the count rate in the 0.5-8 keV band. This approach implicitly uses the ratio of the 0.5-8 keV 
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to 0.5-2 keV count rates, which depends on an assumed source spectrum (Fig. 9a). For a typical 
AGN spectrum with a power-law photon index between 1.4-2 and an absorption smaller than 
~ 10^^ cm^^, a good approximation for the ratio is 1.3. With this approximation, the comparison 
between the observed and expected number-flux relations of the Abell 2125 sources in the 0.5-8 
keV band gives a C-statistic value of 18.4 for 10 bins, which may be rejected with a confidence of 
^ 94%. The predicted number of sources is 75, compared to 83 actual detections (the complex 
members are excluded). This difference is partly due to the expected presence of relatively hard 
X-ray sources, which are not detected in the 0.5-2 kcV band and are thus not included in the 
number-flux relation extrapolated from the same band. Unfortunately, the poor number statistics 
of such sources detected in the field prevents us from placing a tight constraint on their population. 
In addition, one also expects an intrinsic cosmic variance in the source number density from one 
field to another, typically > 6%, depending on the FoV, energy band, and source detection limit 
of observations (see Yang et al. 2003 and references therein). Therefore, even in a relatively 
deep exposure such as the Abell 2125 observation, the number of interlopers cannot be predicted 
accurately. 

The relative uncertainty in the number of interlopers becomes even greater for an ACIS-S 
observation, because of both its small FoV (at least for the BI CCD #7 chip alone) and its different 
energy response from the FI CCDs of the ACIS-I. There is not yet a number-flux relation constructed 
for ACIS-S detected interlopers. Nevertheless, an approximate estimate is often required. One may 
use an approach similar to the one described above for the ACIS-I 0.5-8 keV band. By adopting a 
ratio of the ACIS-S 0.3-7 kcV to ACIS-I 0.5-2 keV count rates as - 2.2 (Fig. 9b), for example, we 
find that the expected number of interlopers is Ns^b ~ 16 in the NGC 4594 observation, compared 
with 112 sources detected in the 0.3-7 keV band. Although the relative uncertainty in the expected 
number of interlopers is statistically large, the bulk of the detected sources are clearly associated 
with the galaxy. 



5.2. Number-Flux Relation of X-ray Sources in NGC 4594 

To facilitate a comparison with the analysis by Di Stefano et al. (2003), we also exclude 
sources that have been identified as foreground stars as well as the nucleus and globular clusters of 
NGC 4594. Fig. 10 presents the number-flux relation for the remaining Ng = 90 sources detected 
in the 0.3-7 keV band. We calculate the re-distribution matrix, by using Eq. 19 with Ng^^ = 16 
as estimated above and the 2MASS K-band image (Fig. 2a) as the galactic source probability 
distribution. Foreground stars in the image are removed approximately with a 9x9 pixel median 
filter (pixel size = 1''5). The predicted interloper contribution is small and is thus fixed in the 
subsequent number-flux analysis. First, we model the galactic component, using a single power law 
and get the best fit as 

Log[iV(S)] = -l.elO;! - 2.llgi^Log(5) (21) 
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(all error bars are at the la confidence level); the C-statistic value 17.0 for 9 degrees of freedom can 

only be rejected statistically at a confidence 92%. Considering all the additional uncertainties 
that arc not included in the analysis (e.g., interloper subtraction errors), we consider the single 
power law fit is still reasonably acceptable. 

Next, we fit the observed number-flux relation with a broken power law (see also Di Stefano 
et al. 2003), 

■"^^ N.(iy\ (22) 



dS J \SbJ 

where F = Fi for S < Sj, and T = T2 iov S > Sf,. With an addition of only two more model 
parameters than in the single power law, this model fit gives a C-statistic value of 8.87, which 
is satisfactory (Fig. 10). The best-fit parameters are Ni, = 1.7 x 10^ sources/(counts s~"^), Sb = 



2.ll^;^ X 10" 



14+0-3 p 
-■-•^-0.3' ^ 



4.6 



+2.7 

-1.2- 



1.7 

Fig. 11 presents the confidence contours, Fi vs. F2. 
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Source count rate (counts/s) 

Fig. 10. — Observed differential number-flux relation of the sources detected in the 0.3-7 keV band, compared with 
the best-fit broken power-law model (dash-dotted histogram) plus the expected interloper contribution (dashed) in 
the field of NGC 4594. The combination of the two model components is represented by the solid histogram. 



In comparison, the analysis by Di Stefano et al. (2003) suggests Fi = 0.74^5 04 > ^2 



9 c;o+0.35 



+0.40 
-0.32 



and a luminosity break at 9^16 ^ 1^^^ erg s~^, corresponding to a count rate break Sj, ~ 1.8 
10~^ counts s~^, assuming a power law spectrum of photon index 2 and an absorption of Nh = 
5 X 10^° cm~^ (Di Stefano et al. 2003). While these parameters are marginally consistent with 
those obtained from our analysis, their constraints on Sj, seem to be considerably tighter than ours. 

This difference is probably largely due to the simplification made in the analysis by Di Stefano 
et al. (2003). Their analysis ignored both the background source contribution and the Eddington 
bias and assumed a single source-detection threshold of 4.5 x 10^^ erg s~^, or a count rate of 
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Fig. 11. — The 68%, 90%, and 99% confidence contours of the broken power law indexes from the best fit shown 
in Fig. 10. 

9 X 10~^ counts s"'^, inferred from the peak at S/N ~ 3.2 in a immher-S/N histogram of the 
detected sources. However, Fig. 8 shows that the average detection probability is still only 80% 
at this count rate. A detection probability > 95%, for example, requires a source count rate 
> 1.4 X 10~^ counts s"-*^. Clearly, part of the turn-over in the observed number-flux relation at Sb 
(Fig. 10) is due to the flux-dependent source detection threshold, which should be accounted for in 
the modeling. Setting a threshold at a higher count rate should have reduced this flux dependence. 
But more than half of the sources would then be excluded in the analysis. 

Of course, the flux-dependence of the threshold is a result of the detection sensitivity variation 
across the field. The variation is even greater for an ACIS-I observation, because of the larger 
off-axis FoV. Therefore, a proper correction for the flux-dependent threshold is essential to the 
analysis of the source number-flux relation, especially to the full utilization of detected sources. 

6. Summary 

We have examined the detection threshold and the Eddington bias in flux measurements for 
sources identifled with limited Poisson counting statistics. Our main results are as follows: 

• We have derived a ML correction for the flux measurement bias, based on a Poissonian error 

distribution. This correction can be applied to individual sources detected with very limited 
counting statistics. The condition for the presence of the ML correction (Eq. 7) should be 
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required for a source detection, in addition to the probability threshold. 

• The source detection threshold varies strongly with position, especially with the ofF-axis angle. 
A flux-dependent correction for this threshold variation is essential to the statistical analysis 
of detected sources. 

• We have developed a simple procedure to calculate the flux redistribution matrix, based on 
the exposure and diffuse background maps obtained directly from an observation. The matrix 

inherently accounts for both the source detection threshold variation and the Eddington bias 
and can be imported into a software package such as XSPEC so that the number-flux relation 
can be analyzed as if it were an X-ray spectrum. 

• We have applied the procedure to a deep ACIS-I observation of the AbcU 2125 complex. With 
10 identified complex members excluded, the remaining X-ray sources show a number-flux 
relation that is consistent with the expected interlopers of the field. 

• We have also analyzed the number-flux relation of X-ray sources detected in an ACIS-S 
observation of NGC 4594. While the contribution from the expected interlopers is shown 
to be small, we have characterized the number-flux relation of the galactic sources, using 
either a power law or a broken power law. The latter model gives a considerably better 
characterization than the former. 

The techniques described in this paper can be easily implemented; an IDL procedure for 
generating the flux-redistribution matrix, for example, can be obtained from the author. 

David Smith is acknowledged for help in writing programs that convert the calculated redistri- 
bution matrix and the background source number-flux relation into the XSPEC-required formats. 
This work was funded by NASA under the grants NAG5-10783, NAG5-9429, NAG5-8999, and 
GOl-2126. 
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